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EXECUTIVE SUMIARY 

The purpose of this study is to develop, a theoretical frame- 
work for unbiased estimation of the dynamic net impact of CETA 
on participants' earnings. The framework is developed for 
empirical application to the Continuous Longitudinal Manpower 
Survey (CLMS) . r ' 

"- "This paper presents a method of estimating CETA's effect, 
free of selectivity bias. The possibility of selectivity bias 
arises due to the non-random nature of participation in the pro- 
gram. That is, if participation is a function of unobservable 
variables (such as ability or motivation) , and these variables 
are also determinants of earnings, it wil^^ impossible to 
distinguish the effects of the urtobservables flom the effects of 
the program without controlling for the selection process. 

In this paper, selectivity biases are controlled for through 
the use of an erjor .components model. The model relies on the 
longitudinal nature of the data to eliminate the effects of the 
unobser^ables by differencing them away. The estimation techniques 
developed here allow for participation to be a non-random function 
of: individual, unobserved variables that are both fixed and 
changing over time, and temporary fluctuations in' earnings pti^r 
to the program. 

This framework represents an advancement in the, state of 
the art in impact estimation for employment and training programs. 
Further, i . ..ages fewer restrictions on the nature and type of 



comparison group necessary for unbiased estimation and, thereby 
contributes to solving a problem that has long plagued evalijfa- 
tions of employment arid training programs. 



I. INTRODUCTION 
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Federally funded employment and training programs have 
become a permanent and major feature of labor market policy in 
the United States. Most of these programs are now funded through 
the Comprehensive Employment and Training Act of 1973 (CETA) . 
By fiscal year 197 9 CETA funding had reached $5.4 billion. 

Despite the magnitude of employment and training programs, 
little is known about their effect on participants after leaving 
the program. Our inability to isolate program impacts is a 
result of the massive amount of data that are necessary. With- 
out specifically controlling for: time, social factors, demo- 
'-raphic variables, the economic climate, and unobservable factors 
^^sociated with each individual - it Is impossible to isolate the 
independent effect of the program. 

Recent availability of the Continuous Longitudinal Manpower 
Survey will eliminate many of these insurmountable data problems 
of the past. These date, contain a representative sample of CETA 
participants, well as comparison groups which have been drawn 
from the Current Population Survey. The availability of the 
comparison groups will permit an isolation of the effect of the , 
program on participants' earnings, independent from the effects 
•of time and the stat-e of the ^econQmy. The longitudinal nature 
of the data onabie an examination of the dyhamic impact of the 
program. Perhaps most importantly, the longitudinal data also 
provide, us witha much better opportunity to control for individi- 
al, unobservable characteristics than would fee available from 
purely cross-sectional data. 

■ , 1 - 



Ths plan of the paper is as follows. Section II describes 

techniques to control for selectivity bias. Section III develop 
consistent estimators for the net impact of employment and train 
ing programs on participants' earnings, allowing for the popsi- 
bility that participation in the program is correlated with indi 
vidual, unobservable characteristics that are both fixed and 
changing over time. These estimators are then generalized to 
allow for participation to be a function of transitory earnings 
fluctuations prior to participation. Section IV describes the 
data that will be used, and outlines the estimation agenda. 



II. METHODS FOR CONSISTENT ESTIMATION 

.increased awarenoas among economists as to tha pervasive- 
ness o. nonrandom selection has spawned a rapidly growing ^ 
literature attempting to deal with the econometric problems that 
it introduces. If the selection process is a function of un- 
observed variables that are correlated with the i dependent 
regressors, ordinary least squares will.^ield inconi stent 
parameter e.tima us. The literature, ti date, has developed two 
strategies for breaking the correlation between the error term 
(the unobservables) and the independent regressors." 

The first approach (developed by Heckman and Maddala an 
Lee) attempts to estimate the value of the unobserved (latent) 
variables by first predicting the outcome of the selection process 
using probit analysis. By assuming that the error terms are 



normally distributed, the expected value of the latent variables 

can be calculated by taking the inverse of the Mill's Ratio of the 
predicted value of the participation variable. By including this 
estimated value in the earnings equation, the omitted variables 
problem is eliminated. The resultant ordinary lea^t squares 
estimates are consistent. 

This approach has a great deal of appeal sin. .a adjust- 
ment for selectivity bias is based on a model of the economic 

- ,n to enter a program. Unfortunately, in the proh^ :m 
being studied here, there is a second se^-'Ction process in addi= 
■tion to self -selection. This selection is based on program, 
administrators' decisions about who will be permitted to parti- 
cipate in the program from the pool of eligible applicants. since 
this decision is likely to be a function of many non-economic 
considerations which are unknown t. an analyst, it is impossible 
to estimate the probit without introducing additional latent 

dude inconsistent estimates. 

Given this problem it seems likely that an error components 
model, the second method of controlling fot selectivity bias, 
is the appropriate technique. These models generally specify 
the latent variable as including both a permanent component 
which is associated with an individual, as well as a transi- 
tory component which is common across individuals. For the pur- 
pose of net impact eEtimation, the existence of a transitory 
component makes it necessary to have, data on both participants 
and nonparticipants. Otherwise it is impossible to isolate the 



^ independent effects of the program from that of the transitory 
error component. 

If the permanent component of the error (the latent varia- 
oie) is constant over time, then first differencing of the data, 
will eliminate any correlation between the remaining error and 
the independent regressors. if, however, the latent variable 
changes over time, then simple first differencing will not be 
sufficient to eliminate the correlation. The next section 
develops a model which allows for latent variables which are 
both fixed and changing over time, as well as transitory unob- 
served components. Consistent estimates are derived, allowiry • 
for the possibility that both tiypes of unobserved components are 



correlated with the independent 



regressors 



III. AN EARNINGS FUNCTION WITH CHANGING 
HETEROGENEITY AND TIME EFFECTS 

A unique characteristic of longitudinal data is that it 
. enables us control for individual unobserved .characteristics 
(heterogeneity) to an eKtent that is not possible using purely 
cross-sect ione^r data, a frequently used specification of earn- 
xngs determination is ■ 

(1) - BiXi + 02^.^ + p. + ^t + ^it 

Where is the earnings of the tth individual in time t, Xi - 

is a vector of background variables for the tth individual that 
remains fixed over time, and Z^^ isi a vector of variables that 

O 4 - 
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changes over time. The error term consists of the components: 
ej_ which is unique to the individual and fixed over time, 
which is common across all individuals at time t, and e^^ which 
is specific to individual i at time t. 

In recent work, Ashenfelter has used this model to estimate 
the impact of employment and training programs on participants' 
earnings. He assumed that Z^^ included a participation dummy 
variable and a cubic in age, and that s.^ was a random error term 
with zero expectation. Using longitudinal data on a sample of 
1964 MDTA participants, as well as a comparison group drawn from 
the CPS, Ashenfelter was able to estimate^ , ■ 

%t " ^is = (St = + 02^Zit - Zis) + (Eit - HsJ 

where s was the base (pre-program) year and t represents a series 
of post^program years. 

Any selectivity bias present in equation (1) due to correla- 
tion between the fi^d unobservables , e . , and the independent 
lOgressors, Z^^, has been eliminated from equation (2) by first 
differencing. So if participation in the program is a function 
of unobservables that are constant over time (such as innate 
ability), equation (2) will yield unbiased net impact estimates. 
If, however, participation in the program is a functiod^f indi= 
vidua!, unobservable variables that are not ,fix^fr-tj^ time, the 
e^t term, equation (2) will produce biased estimates since the 

ssumption that ^U^^ - = Zis^^. = 0 is violated. 

1 \ 

An example of an %uch a variable might be local labor market 
conditions. Since employment and paining programs nre generally 
funded at higher levels in areas experiencing unusually high 
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levels of structural or cyclical unemployment, participaticn is 
likely to be a funp'rt^n of local labBr market conditions. Since 
these conditions |re r^Lt fixed over time, any selectivity bias 
created by them will not be eliminated, .by,,. equation (2). 

It is possible to generalize the specification of the earn- 
ings function in equation (1) to allow for correlation between th 
latent variables that are not fixed over time (changing hetero- 
geneity) and program participation. By assuming that these 
variab;Les change over time at a constant rate 1), a generalized 
version of equation (1) is rewritten below. 

^ ^3) Yi]^ = 6iKi + 02Zik + Ei + + : 
' ^ (4) t^^ ^ PEik-1 + ^Lk . - : 

and (5) uij, ='Qek + Qnk ' ^ 

where Qgj^ is a random (transitory) error for program participants 
and is a random error term for non-participants. The sum of 

these two terms, u^j^ is a random error term with zero expectation 
and no serial correlation. has been written in this way to 

allow for the possibility that participation is a function of 
transitory fluctuations in earnings, prior to the prpiram. This 
would capture the possibility that cyclically une,mployed workers 
are more likely to participate in the prc^gram. The Q te^^could 
also incorporate the effect of "creaming",' If program administra- 
tors "cream" from the pool of eligible applicants, then 
E(Qqj^) < E(Qnk) , k < i 

, E(Qek) - E(Qnk) , k _ > * 

where t is the period o£ participation. This would mean that- 
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while the participants met the eli.gibility requirements for the - 
program, their eligibility was only teinporary because of an unlucky 

year prior to program participation. - By allowing for the possi- 
bility that E(Qj^) 4 0 prior to the program, any correlation be- . 
tween v^_^ and Zj^ can be removed. As we will see below, this is ' ' 
critical if p 4, 0. 

/ pr the sake of expositional simplicity, we will begin by ' 
assumng that E(Qj^) = o, for all t. Towards the end of this sec- 
tion, this assumption will be relaxed. The availability of an 
li . ■ . 2 

appropriate" comparison grouiJ is assumed. Also it will be 

assumed throughout that^nhe and p are constant oyer time, ^ 

although these parameters may vary between the participants and ■ 
non-participants. , 

* 

Using this assumption, equation (3) is rewritten below in 
first differences form. . • ■ '' ' 

^it - ^it-1 =, (Et - Et-lJ + ^2(Ht- Zit-l) + 

(Eit - Eit-l) ■ - . - ' 



1 - 



rhese requirements generally restrict participation in the 
program to, individuals who: have experienced a spell of ' 
unemployment, have very low earhings, or are welfare reci- 
pients , " • " , 

2 -.Techniques for choosing an "appropriate:" comoarison groups 
are described In th# next section. 
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The model is written here using the assumption that the 
parameter values are constant across the two groups This 
assumption can be easily relaxed. It is also possible to" 
generalize the model to allow^for a higher order autoregress- 
ivQ^scneme, although no such generalization will be reported 
assumition^^K*. ^^-'^ possible, -however, to reiaK the 
ilentificatJnn J " constant and simultaneously maintain 
xaentirication of prDg.ram ^impacts . ^^ -^^ 



f 



.The\ impact of ll:he- program on participants' earnings in period t 
^Ls captured in 35 •' ■^''Ji^, that the fixed heterogeneity (e^) has 
. been eliminated by differencing. If, however, the heterogeneity 
changes over time, then simple\ differencing will not completely 
eliminate the possibility of selectivity bias. Any remaining 
inconsistency can be eliminated by using equation (4) to rewrite 
the error term in eyuatlon (6). 

. Eit - Eit-1 = P(eit=l = eit=2) + ^it ^It-r 

/ , \ (^it-2-eiXi-e2Zit=2-ei-et-2)] + 
(Uit-Uit-l) 

^7) Eit - eit=l = P(et-2 - Et-l) - Ph^^Lt-l-^it-l^ + 
. . V , P(^it-1 - ^it-2) + (^it-^it-l) 

It is not possible to use (7) as^the basis for oonsistent estima- 
tion of p since chere is a, negative correlation of p between the 

error term and p (Y • -^.^^2)' This problem' cannot be eliminated 

5 ' 

by differencing. over two periods. To see ^his consider equation 

(8). \ \- , ■ ' 

\ ■ ■ ■ ■ " 

(8) Sit - E^^_2 = 'P^et=3-H-l) - P02(Zit=l-Zlt-3) + - 

P^^it-l-Yit-3) + (Uit-Uit-2) ' - 



a?v%?tJ^ iL t n^t-l = 0^ no comparison group would be necess- 

i ^ ^ intercept would, then measure the program' s impact 
hpld ^^""^"^^^ economy, however, this condition Is unlikely to 

5 - Thanks^ <due . to Gary ChamberUin for pointing out a mistake 
In this section of an earlier draft of the paper. ■ 
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From equations (3) and (4) We have 

(9) = 01^1 + 32Zit-l + £1 + et-1 + p^eit-3 + pUit-2 

By 'substituting equaiion (9) into (8) we would be left with a 
negative correlation of _between thi error term and P (Yit-l- 
Yit-3)- Higher ordej differencing would reduce the magnitude 
of the bias that is created by this correlation, but it would 
never be eliminated. 

The only way to consistently estimate p is by using an 
instrument, ^^^-1' ^it-1 in equation (7). By substituting 

equation (7) into (6) we are then left with 

. . ^10^, ^it - ^it-1 "■■ tet-Efl " P (et-l-Et-2) ] + 

. ^, , : . 62 CZit-Zit-1 - P(Zit-l-Zit-2)] "t.- 

P^^it,i"^it-2):=rl;uit - uit-i) 

" Now suppose that t is the period of program participation, 

and the program results in a shift in earnings of -0^ during 
.that time. Equation (9) can be rewritten as i 

.n. (11) .Yt - Yt-i = 0o + B2*^t* 0t P + P (?t-l-'^t-2) + 

■ ■ ' (Ut - Ut_i) 

Some notational simplification has been introduced here which 
will be maintained throughout. The % subscript: has been dropped. 
0o indicates a combination of transitory error components, 82* 
represents a non-linear combination of S2 and p, and Z^* repre- 
sents differencing of the Zt^ector, where no longer contains 
the participation variable. P is a dummy variable measuring pro- 
gram particiation, and 0^ is an unbiased estimate of the programs' 



- 9 - 



effect in year t. Program impacts in period t+1 are calculated 
below in a similar manner. ''" 

(12) Vt+i - - 0o + e2*2*t+i + {0t+l-pSt)P + P(h " ^t-i), 

In orddr to consistently estimate the cumulative impact of the 
program, Q^^^, it Is necessary to first estimate p from equation 
(11). Then multiplying equation (11) by p and adding it to (12), 
leaves . " 

(13) Yt+i - (l+p)Yt-pY^ - P'(^t-l-Yt-2)=^ Bo + B2*Z*t+l + 

^t+iP (i-p),tf:^ - pu^_-lJ 
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It is possible to continue to solve for the cumulative program 
impacts in later years in a similar manner. However, the left 
hand side of the equation becomes eKtremely complicated. A 
more; straight forward method is to, solve for the iinpacts 
. recursively, ..using the estimated values in period it+j) and' 
(ifr+^fl) .td solve' for the impact in period' (tH+2) Both solution 
methods are con s^if tent. The cumulative impact in any period oan^ 
be calculated by the use of equation (14). 

• (14) Yt^^n - Y^+n-l ='eo+32*Z*t+n + Bt+n^ + ' ' 

(ut+n " ^t+n-1^ \ ' ■ / - 

Once p, has been estimated from equation (11), and g^^^^^- ind/ ' 
Pt+n-2 have been solved for, it would then be possible to' estimate 
^t+n re-writing equation .(14) as / 
(15) Yt+n - Vn-1 ^ ^^^t+n-l^Bt+n^l ' Yt+n-2 + it+n-2) 
^o-^ ^*2Z*t+n + BttnP + (Vt+n-^t+n-l) 

- 10 - 



The-unique feature o£ this technique is that it provides us 

with a framework for consistent estimation of the dynamic impact 

of the program on participants » earnings under a very broad range 

of non-random selection processes. Equation (15) is eufficiently 

general to allow for participation to be correlated with latent 

variables that are both fixed and changing over time. Previous 

estimates have allowed for either fixed heterogeneity or changing 

6 

heterogeneity, but not both simultaneously. 

As was mentioned earlier, it is also important to consider 
the possibility that participation in the program may be correlated 
with the pre-program transitory error component, 'U^_2^. For 
instance, if program administrators "creamed" from the pool of 
eligible applicants, then E(Ut^x)<0 for participants,. This 
would create correlation between and participation in the 

program. Sinqe .>)t-i is an element of the error component in the 
net impact estimation equations developed here [see equation 
fl3)], this will create an additional source- of inconsistency 
unless specifically controlled for. Qnce it is recognised that the Q 
variable has the same' role in the pre-program years as the P 
variable does in the pQst-^program years , it follows that ^ equa- - 
tions (13) and (15): are sufficiently general for consistent net 



6 - Ashenfelter's estimates of equation (2) have eliminated , any 
bias from non-random selection due to fixed latent variables. 
He also estimated equation' (1) where 2^^ included lagged values 
of Y. This method is sufficient to eliminate any bias created 
from non-random, selection due to changing latent variables . It 
will not, however', eliminate fixed effeci;s bias. 

. / . ■ 

• - 11 - 
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impact estimatron in the cas& when Q^^j, 4 0.' For instance, if 
we knew the value of Q^^y^, we could estimate the net impact of 
the program in period t from equation (16) below. 

(16) - Y^-i ^'-p(Yt-^i-Y^_2)+e(Qt=l-Qt>2l = 0o+02*Zt*+ 
StP + (Ut=Ut^j) 

As in the case if net impact estima^on, where it was necessary 
to first estimate 0t before esrimatinD Pt+n' we must first 
estimate . In order to be able to do this, however, it is 
necessary k assume that Q^.j = 0, for j>B. To see this/ con- 
sider th3^ fc; ■owing nre-program earnings equation where Q is 

t-3 

not constra,i:\iid to be sero. 

\:^t-i-Qt-2"^J-3+&fe-3> + ^^t=l-Ut-2') ' ' 

Without assursima -hH'- ' ' ■ i" >■ ,'4. ■ 

J .-.ty .-n„u .,,^„.. x,.^ It is not possible to 

estimate g^^T^. Hovsvf?r by impeding this assumption, we are leff 
with . . ' ^ 

/ ■ (18) Y.^j^ .Vt=2==e^+32*z*t_j^4.p($^^^^^^ 

The residual from equation (18) gives us an estimate of Q 

. ■ t-1 
since e(^^_;L-u^,^3)=0. By including Q^_^ in the' net impact 

estimation equation, any bias that results 'from the program ' 



7 It IS assumed that p created a shift in the earning 's functiort 
. of participants after the program. The Q variab^S assumes the 

same role prior to the program. assumes the 

8 -. This simply rewrites equation (15). 



on the basis of the pire-program transitory error component 

has been eliminated. The moat general version of equation (15) 

is given below. - 

,.<19) Yt+j+2-Yt+j+lJP($t+j+l''yt+j)+P(%+j+l-R^^^^ - 

3o+62*2*tf j+3+Rt+j+2+Ut+j+2-Ut+j+l 
where = \ I >_ t 

• H = Si A i - t-1 



Rj = 0 



I < t-1 



Estimation of eqijation (19) will provide consistent ekti- ' 
mates under very general conditions. Its validity, however,\is 
dependent upon the following key assumptions: (1) p is con- \ 
stant over the entire time span being considered, (2) the earn- 
ings equations to be estimated are the same for the comparison 
group as for the particJ-pants except for the P and Q^terms, (3) 
Qt-j»0, for- Q>2, and (4) both P and Q enter the earnings equa- 
tions linearly. The validity of the first three assumptions Is 
easily tested. It i's p'^ossible to relax assumption (4) by doing 
separate^net impact e4timatiori^ for' the different -age/race/sex 
groups . . 



9 Unlike 3, it is, necessary to ftstimate\ Q for non-participants 
..^ as weJJ^- as participants. However, a ieparate value of 0 is 
estimated for each group. , 1 , , 

10- The analysis could he done separately fty whatever variables 
m X or Y seem likely to interact withJp or Q in the earnings 
equations. ■ ^ ■ ,,j i 
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Perhapi the most sorious drawback. of this , approach is that 
it iff not appropriate for very young participants of employment 
■ and training programs. iince very few of the youngest partici- 
pants would have any. eaj-nings for the three years prior 'to pro- 
gram participation, it is impossible to estimate p for this 
group. This is especially unfortunate since employment ' and 
t.rain ng programs generally have a large number of young parti- 
cipants. Nevertheless, a thorough examination of this problem 
is beyond the scope of the current analysis. The empirical 
., analysis will be done only for participants who were at. least , 
twenty- three years of age upon entering the program. 

It seems likely, however, that, the advantagos of this 
approach more than outweigh its shortcomings. It provides a frame 
work for consistent es,timation of the dynamic impact of employment 
and training programs on participants' earnings. The model 
allows for unobsorved variables which are both fixed (such as 
.ability), and changing over time (such as health or local labor 
market conditions) . These unobaervables, as well as the transi- 
tory error, component, may be correlated with participation in the 
.program. Pinaliy, the model is consistent with a very general 
.^specification of the earnings, function. ' it is" consistent with ' 
a model in which: (1), -lagged values of earnings ■ affect current' 
earnings (above and beyond the eJcfent to which they represent 



fixed effects) or (2) disturbances in earnings are correlated 
over time. /' . 



IV. DATA Km ESTIMATION .AGENDA 

The Continuous Longitudinal Manpower Survey' (CLMS) public 
use tapes will be used for the estimation suggested by Section 
III. The CLMS represents a major data development effort of the 
Department of Labor for evaluation of CETA funded employment ' and 
training programs. For a representative sample of 6,700 indivi- 
duals that participated in/%he program 4uring'1975 and 13,300- 
who participated during fiscal year 1976, the file contains: a fo 
year record of labor force experience beginning one year prior 
to CETA enrollment, basic demographic characteristics, a his- 
tory of public benefi^ts received by the individual, and/or the 
individual's family^,^; family-related variables, and reported ' 
annual social security earnings for 1951-1977.. For purposes • 
of comparison, thi March CPS (the annual demographic, file) "has ' 
been included with reported annual^ social security earriings and 
counts of quarter 3^^%^^ appended. 



11 In^ general, it is not possible to empirically. distinguish 
between these two models. See, Appendix a for a discussion 
of the necessary conditions for identification ' 
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Data collection and preparation have been carried out by 

the Bureau of Census. Westat, Inc. has been responsibla lor data 

• management, preliminary analysis, and development of comparison 

groups for the CETA participants from the CPS. The comparison 

groups have been generated by "matching" CETA participants with 

their CPS counterparts on a variety of socio-demographic and 

past earnings variables, using different priorities in the 

matching process. These procedures has been used to generate 

six (6) comparison groups for the 1975 partiaipants and three (3) 

comparison groups for the 1976 participants. 

The results derived in Section III suggest a framework for 

the empirical work. It is first necessary to estimate p for each 

of several years before and after the participation year. Next, the 

data should be pooled in order to estimate the aggregate value of ■ 

p-. This, should be done separately for the participa^its, the 

comparison group, and the two combined. Chow tests should be 

performed within each group, and across the two groups in order • 

to determine (1) the appropriateness of pooling p, and (2) the / 
. . . 13 = ■ . ■ ■/ 

appropriateness of the comparison group. At this point it will / 

be possible to re-estimate the pre-prograin earnings equations in / 

order to .estimate Qt- j . By including a dummy variable foi parti/ 

cipation in the pre-program equations', we .will be able to deteW 

... ' ' ^ r 

mine ifi^ E(Qt.j)-0, for j<b: This entire process will be done with 

/■ 

each comparson group in^ order to. determine which comes closest to 
matching the participants. / " 



12 - See. Appendix B for a consistent estimator of an aggregate p' 

13 - Note^that at is not necessary that 02 and- p are" the same across 
' thm two groups. The Chow tests can be done by constraining 

only the go term to be the same for the participants and nbn- " 
participants^ "'^^^^^ > ^ , " 
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APPENDIX A 



' Identification of Alternative Mod'fels 

For the purposes of this AppendiK, we- will be using the 
following specification of earnings determination^ 
, ',; (1). Yt = 6l^t-l + ejYt-a + .. 
„(2) ^ PiSt-l + P2H~2 + "t 

where is a random variable with an expected value of zero. 
We will consider the following cases i 



(A) 


02 = 


^1 " 


P2 ~ ° 


(B) 


■ = 


h = 


P^ = 0 


(C) 


.62 - 


92 = 


0 


(D) 


Pi = 


P2 = 


0 


(E) 


01 = 


02 = 


0 



First differencing of equatiGn (1) tor cases (h) - (Q) gives the 
following equations i , . , , 

(CI) Yt - = (6l+'Pl)l(Yt_i-Yt-2) - BlPl(Yt-2-Yt-3)+Ut-U 



1 - .The X and Z vectors lised in the text have been Excluded for 
sake of expositional, simplicity. . in general, adding them t( 
equation (1) does not affect any of the results. 



Equations (Al) and (Bl) are not : empirically distinUishable^. 

If we are unable to reject the null hypothesis that' gjp^ = 0, 
it, would then be necessary to geheralize the model in the text ' 
to allow for a second order autoregreasive error structure. 'This 
follows since it is imposaible to distinguish (D) , a second order 
autoregressive structure from (C) or (E) . To see this, equation 
(1) is written below in first differences form for (D) and (E) . 

(Dl) Yt = Yt=i = Si(Yt.i = Yt.2) + 02 ( Yt=-2-^t^.3 ) + ^t^Ut-i 
{ED Yt - Yt-i = Pi,($t-1 - + P2(Yt-2-^t-3) + ^t'^fl 

So if we are able to accept the null hypothesis that &ipi is 
zero in equation (CI), then we can conclude that either: (1) the 
specification of the earnints function should include a lagged 
value of the dependent variable, or (2) the disturbances in the 
earnings function follow a first order autoregressive scheme. 
If we reject the null hypbthesis, then we' can conclude that 
either J (a) the previous two alternatives are simultaneously 
true, or (b) the specif ication of.. the earnings function should 
include two lagged values of the dependent variable, or (3) the 
disturbances in the earnings function follow a second order 
autoregressive scheme. All of these ^Iternatives are consistent- 
with heterogeneity that changes, over /time. 

These findings can be generalized to handle more complicated 
^peci ficaltiohs^of eguatidna (^) -nan 



2 - If z changes over time in a non-trend like fashion, it may 
be possible to identify (A) from (B) through Z. in general 
not changas in/Z. (such as age or experience) will 

and 'Jlj to enable us to distinguish between (A) 
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APPENDIX B 



Consistent Estimation of Aggregate p 



Continuing t9 use the notation developea in the text. 

(1) Y^=6lX+e2Zt+Rt+H+et+P(Yt-l-BlX-e2Zt=i-Rt=a-ei-e^ 
Rewriting (1) , by first differencing gives , 

{2) Yt-Yt-l=[et--(l+P) et-l+pet~2.] + fPt " ■ Cl+P) Rt-l+PRt-2 
■ + 32 CZt-(l+P)Zt-i + PZt-2] + P(Vl-'^t-2^ 

Introducing some notional simplification 

(3) Yt-Y^.^ = H-.R^ + e^zt* + P(Vi - ^t-i) + u^u^^^ 

where is an intercept term common to the comparison ■group and 
the participants, and Rt is a . shift in the intetc^pl term for .parti- 
cipants. .Both of these terms are unique to period" t. i This suggests 
that if a pooled version ^of (3) wrire to be estimated, it would "ie 
necessary to allow for a separate intercept term for each year, as 
^well as for each of the two groups. The nature of the error struc- 
ture in equation (3) malies it impossible to obtain consistent esti- 
mates of an aggregate p by simpiy pooling (3) over time! To, see : . .. 
this, consider equations (4) -(7) below. Here the specific' 
yeirs that we are concerned with have been used in the subscript- 
ing^ ' , , , , . • 

^77-^76 = 677+^77+622*77+0 {Y-g=y75)+ U77-U7g ' ' 

^76-^75 " 876+R76+02Z*76+P^57S-y74) + U7g-U75 
^6) Y75-Y74 = 075+R75+B2Z*75+p(J74-Y73) "+ U75-U74 
(7) Y74-Y73 = B74+R74 + 022*74 + P (573-Y7.2).. + U74-U73 



Note that It IS not possible to pool any two consecutive equa- 
tions since this would create negative correlation between the 
.error term and P(Yn-Y^_^)/ In fact, no linear combination of^.^ 
equations (4) -(7) can completely eliminate this negative correla- 
tion. It is, therefore, necessary to estimate equations (4) -(7) 
as a system of equ^tior^s, imposing constraints across the equa- 
tions to obtain aggregate' estimates of p and Bj ■ Ordinary 
leait-squares estimates will be' consistent but inefficient. 
Zeiler Seemingly Unrelated Regression techniques will produce 
efficient as well as consistent estimates. 
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